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Abstract 

This paper investigated the phonological acquisition of English /s/ + consonant onset clusters by Korean learners of 
English as a Foreign Language (EFL) who varied in their levels of proficiency. The data were collected from twenty 
eighth-graders in a Korean secondary school, who were divided into two groups according to their proficiency: low- 
and high-achievement. The major findings were: 1) the high-achievement group did not use a vowel epenthesis strategy 
in the articulation of the /s/ + consonant onset cluster, unlike the low-achievement group; 2) the duration of /s/ 
pronunciation was longer in the high-achievement group, as follows: /s/ + stop + liquid, /s/ + stop, /s/ + liquid; 3) the 
low-achievement group’s duration of oral closure was longer than was that of the high-achievement group, as follows: 
/s/ + stop + liquid, s/ + stop, and 4) with regard to how /s/ + consonant onset clusters are perceived by native English 
speakers, /s/ + stop + liquid was related more significantly to the learners’ level of proficiency than were the biliteral 
consonant onset clusters. Among biliteral onsets, /s/ + stop and /s/ + liquid clusters differed significantly between the 
groups, while the /s/ + nasal cluster did not. 

Keywords: English /s/ + consonant onset clusters, LI transfer, the sonority sequencing principle, syllable structure, L2 
phonological acquisition 

1. Introduction 

As influenced by two primary approaches, first language (LI) transfer and markedness effects, the question pertaining 
to how certain phonological structures are produced by second language (L2) learners has been raised frequently. 
Specifically, with respect to consonant clusters, earlier research of LI transfer assumed that transfer leads to consonant 
cluster simplification (Broselow, 1984; Lado, 1957; Major, 2001). Expanding on the conformity with LI syllable 
structures, a number of later scholars emphasized that L2 speakers of English simplify consonant clustering to conform 
to universal linguistic principles (Klove & Young-Scholten, 2008; Osburne, 1996). 

Among consonant cluster types, the segment /s/ is especially noteworthy, /s/ is often regarded as having similar phonetic 
features across languages that leads to positive transfer (Koffi, 2011). Nonetheless, the pronunciation of /s/ in complex 
onset clusters is a tremendous challenge for L2 learners of English, because no two languages pronounce complex /s/ 
onset clusters in an identical manner due to the different phonotactic constraints of languages (Duanmu, 2002; 
Sperbeck, 2010). This situation also holds true for Korean and English, /s/ appears in a wide array of phonological 
environments in English, but not in Korean, as the latter does not allow consonant clusters within a syllable. In addition, 
many of the /s/ distribution patterns in English, unlike those in Korean, violate the Sonority Sequencing Principle. 

Thus far, little attention has been given to the way in which L2 speakers of English apply these rules that govern the 
pronunciations of the English /s/ in complex onset clusters. The majority of the previous literature that has investigated 
consonant clusters concerns modification strategies, such as consonant cluster deletion and vowel epenthesis. The order 
of acquisition of English consonant clusters (Yoo, 2004), and gender effects on the articulation of some English 
consonant clusters (Park, 2008), also have been investigated rarely. 

Therefore, this study examined the acquisition of English consonant clusters by Korean EFL learners of English. The 
study focused on onset in an attempt to determine the differences in acquisition of consonant onset cluster types with 
respect to the level of L2 proficiency. The research questions addressed were as follows: 

(1) Do Korean EFL learners with different levels of English proficiency pronounce complex onset clusters with a word- 
initial /s/ differently? 

(1-1) Is there a difference in the aspects of pronouncing triliteral onset with word-initial /s/ according to the level of 
English proficiency? If so, how do learners modify the pronunciation (i.e., s + stop + liquid) according to their level of 
proficiency? 

(1-2) Is there a difference in the aspects of pronouncing biliteral onset with word-initial /s/ according to the level of 
English proficiency? If so, how do learners modify the pronunciation of each cluster type (i.e., s + stop, s + nasal, s + 
liquid) according to their level of proficiency? 
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(2) Does the difference in pronunciation by Korean secondary school learners with different levels of English 
proficiency affect native English speakers’ judgment of accuracy in pronouncing complex onset clusters with word- 
initial /s/? 

2. Theoretical Background 

2.1 Cross-linguistic differences in syllable structures 

The concept that “transfer” occurs between LI and L2 has been one of the most important issues for linguists, and has 
been regarded as the major source of difficulties for L2 learners (Altenberg & Vago, 1983; Yavas, 2005). One example 
of a cross-linguistic difference is the syllable structures of languages. While the basic syllable structure is represented as 
the combination of a single consonant and a single vowel, i.e., CV, languages can have more consonants and different 
syllable structures, as shown in Table 1. Korean syllable structures do not allow two or more consonants to occur in the 
onset of a syllable, while English permits two or three consonants at the beginning of syllables. 


Table 1. Comparison of syllable structures of English and Korean 


English Syllables 

Korean Syllables 

CV 

CV 

cvc 

cvc 

VC 

VC 

V 

V 

cvcc 


ccvc 


ccv 


vcc 


ccvcc 


cccvc 


cccvcc 


cvccc 


ccvccc 


cccv 



The cross-linguistic differences between LI and L2 syllable patterns are challenging for L2 learners. They tend to use 
LI syllable structure patterns in the context of L2 by using vowel epenthesis and consonant deletion to modify 
consonant clusters (Major, 2001; Weinberger, 1997). Many researchers have claimed that Arabic EFL learners used 
epenthesis when a target L2 syllable structure is absent in their LI (Al-Shuaibi, 2006; Broselow, 1983, Kharma & Hajjaj, 
1989); Korean EFL learners inserted a schwa sound between English stop + liquid clusters (Kim, 2002; Tarone, 1980). 
On the other hand, Anderson (1987) insisted that the frequency of epenthesis or deletion strategy is positively related to 
the length of onsets. Yet, little was known about the extent to which L2 learners modify different consonant cluster 
types. 

2.2 Violation of phonotactic constraint: the Sonority Sequencing Principle 

The Sonority Sequencing Principle (SSP) is considered universal, as it applies to a large number of languages. Each 
phoneme is believed to have a sonority scale that shows the relative phonological prominence of different sounds, as 
shown in Table 2. According to the SSP, sonority must rise towards the syllable peak at onset. Although English 
complex onset clusters conform to the SSP in most cases, they violate it in such cases as complex onset clusters with 
word-initial /s/. The question addressed here is what happens when Korean L2 learners, whose LI conforms to the SSP, 
speak an L2 that violates it. 


Table 2. The arithmetic of sonority 


Sounds 

Sonority 

Scale 

[a, x, q] low vowel 

10 

[e, o] mid vowels 

9 

[i, u, j, w] high vowels 

8 

[r] rhotic 

7 

[1] liquid 

6 

[m, n, q] nasals 

5 

[s] sibilant 

4 

[v, z, 5] voiced fricatives 

3 

[f, 0] voiceless fricatives 

2 

[tf, d 3 ] affricates 

1.5 

[b, d, g] voiced stops 

1 

[p, t, k] voiceless stops 

0.5 


(Adapted from Goldsmith, 1990) 
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The SSP violation and its effect on L2 consonant cluster production have been investigated by several studies 
(Abrahamsson, 1999; Carlisle, 1991; Davidson, Jusczyk, & Smolensky, 2009; Eckman & Iverson, 1993). The results 
indicated that the sonority distance between consonants can render certain patterns of onset consonant cluster, which 
abide by the SSP, easier than others, which violate the SSP Yet, how L2 learners with different levels of proficiency use 
modification strategies according to varying sonority gaps in consonant clusters has been open to debate in the literature. 

3. Methods 

The study was conducted with twenty eighth-graders in a Korean secondary school. They were divided into two groups 
according to the results of the TOEIC Bridge® Test, i.e., ten were assigned to a low-achievement group and the other 
ten to a high-achievement group. All of them had studied English for more than five years in Korea without having 
lived in an English-speaking country. An independent samples t-test confirmed that the high-achievement group (M = 
165.80, SD = 11.44, N = 10) was significantly more proficient than was the low achievement group (M = 30.90, SD 
=12.71, N = 10), tis = 24.95, p < 0.001). 

Target consonant clusters (Table 3) were selected in accordance with the classification of the English consonant clusters 
(Prator & Robinett, 1985) and the word lists provided in the secondary school English textbooks. 


Table 3. Target sonority clusters 

Target Word 


Triliteral 

Onset 

s + stop + liquid 

Istrl 

Iskrl 

/spr/ 

/spl/ 

street, stream 
script, scream 
spring, spread 
split, splendid 

Biliteral 

Onset 

s + stop 

/sk/ 

/st/ 

/sp/ 

skip, school 
style, study 
spoon,speak 

s + nasal 

Isml 

/sn/ 

smile, small 
snow, snail 

s + liquid 

/si/ 

slave, slight 


Two tests, the Production and Perception Test, were implemented to investigate the acquisition of onset consonant 
clusters. For the Production Test, a Word Reading Task was used, in which the learner participants were given a word 
list on paper and asked to read the words as naturally as possible. They were allowed first to go through the words for 
approximately three minutes, and to ask questions regarding any words with which they were unfamiliar. After the 
recordings were uploaded, the Praat software program (Boersma & Weenik, 2012) was used to analyze the speech 
sounds. To identify the cluster and the /s/ boundaries, the onset and offset of /s/ was marked with a boundary where the 
high frequency energy appeared and ended. The oral closure was marked at the point at which the intensity dropped 
distinctly, with a loss of energy in the higher formants. During segmentation, if it was difficult to determine a boundary 
point, the spectrogram display and auditory judgment were used. After the speech was segmented and labeled, the 
duration of every /s/ and one of the oral closures was measured. All interval durations were also investigated to see 
whether vowel insertion and epenthesis occurred between consonants and in the starting position of syllables. Together 
with the acoustic properties of the spoken data, human rating methods were implemented. Four hundred speech tokens 
(twenty words * twenty subjects), two hundred for the low-achievement group and two hundred for the high- 
achievement group, were presented for rating by two native English speakers who had more than three years of teaching 
experience and evaluated the recorded data independently. The two native raters also participated in evaluating the 
Perception Test, and rated the total set of speech data on a 5-point scale where 1 = “very poor accurate English /s/ + 
complex onset clusters,” and 5 = “very excellent accurate English /s/ + complex onset clusters.” 

4. Results 

4.1 Production test 

4.1.1 Vowel epenthesis 

Epenthesis was assessed by scanning the spectrogram and the waveforms visually. The researcher looked for a sharp 
change in energy at the onset or offset of the clear formant structure, and checked the frequency of the formants by 
reference to Ramirez’s (2006) study to detect the epenthetic vowel. For example, the first, second, and third formants of 
the epenthetic l\l had means of 569.17, 1944.00, 3039.95 Hz, respectively. Vowels normally have three different 
formants that seem to differ from consonants, which appear as noticeable dark bands based on the vowel position. 

Three students in the low-achievement group produced vowel epenthesis when a syllable began with Is/ and was 
followed by a voiceless stop either in triliteral or biliteral onset (compare Figures 1 and 2). For the high-achievement 
group, vowel epenthesis did not serve as a strategy in the articulation of English I si + consonant onset clusters. In 
addition, there was a significant difference in the I si durations, as well as the durations of the oral closure between both 
groups, which will be discussed in the next section. 
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-0.6579 
5000 Hz 



1852 Hz 
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skip 


O 424399 


Visible part 0.424399 seconds 


Total duration 0.424399 seconds 



Figure 1. The word skip without vowel epenthesis by a high-achievement student 


0.503696 



Total duration 0.503696 seconds 


Figure 2. The word skip with vowel epenthesis by a low-achievement student 
4.1.2 Duration of /s/ and oral closure 

The mean durations for both groups were calculated to investigate how the high- and low-achievement groups differed 
in their production of each type of cluster. An independent sample t-test was conducted to interpret the effects of group 
differences in the production of /s/ and oral closure. As shown in Table 4, the high-achievement group differed 
significantly from the low-achievement group in terms of their duration of production of /s/ in triliteral and biliteral 
onset with word-initial /s/, except /s/ in /s/ + nasal. The duration of pronunciation of /s/ by the high-achievement group 
was approximately 234% (/s/ + stop + liquid), 52% (/s/ + stop), and 12% (/s/ + liquid) longer than that in the low- 
achievement group. 


Table 4. Mean /s/ duration (in s) for high and low-achievement groups 



Triliteral Onsets 

Biliteral Onsets 


/s/ in /s/ + stop+ liquid 

/s/ in /s/ + stop 

/s/ in /s/ + nasal 

/s/ in /s/ + liquid 

High 

Low 

M SD t 

0.21 0.01 

0.06 0.02 

M SD t 

o.n 0.02 

0.15 0.02 U ' UU 

M SD t 

°- 14 °- 01 0 122 
0.13 0.01 

M SD t 

°' 14 °- 01 0 017* 

0.12 0.02 


*/? < 0.05 


In addition, to determine the quantitative relationship of the /s/ sound to the entire utterance of a target word, the 
percentage of the duration of /s/ pronunciation was calculated to analyze the association between the students’ 
proficiency and the mean /s/ duration, while controlling each student’s latency to pronounce a given target word. The 
process of calculating the percentage entailed dividing the time taken to pronounce the /s/ sound by the total time taken 
to pronounce a target word that contained the calculated /s/ sound. 


Table 5. Mean /s/ duration (in %) for high and low-achievement groups 


Triliteral Onsets 

Biliteral Onsets 

/s/ in /s/ + stop+ liquid 

/s/ in /s/ + stop 

/s/ in /s/ + nasal 

/s/ in /s/ + liquid 

x 2 P 

20.00 0.018* 

x 2 P 

20.00 0.029* 

x 2 P 

6.53 0.366 

x 2 

20.00 

P 

0.045* 


* p < 0.05 


As shown in Table 5, the results were similar to those in Table 4. The percentage of the duration of pronunciation of /s/ 
in the /s/ + stop + liquid cluster was approximately 38% for the high-achievement group and 6% for the low- 
achievement group. The /s/ + stop cluster and /s/ + liquid cluster were approximately 29% and 39% for the high- 
achievement group, and 15% and 29% for the low-achievement group, respectively. Thus, the percentage of the 
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duration of pronunciation of /s/ by the high-achievement group was approximately 533% (Is/ + stop + liquid), 93% (/s/ 
+ stop), and 34% (/s/ + liquid) longer than that of the low-achievement group. 

Because the data above showed a significant difference in the /s/ duration between the two groups and the spectrograms 
demonstrated quite a difference when scanned visually, the duration of the oral closure in the clusters was analyzed 
further. A significant difference was observed in the mean duration between the high and low-achievement groups when 
/s/ was followed by a stop sound (see Table 6). Contrary to the results for the duration of pronunciation of /s/ above, the 
mean duration of oral closure for the low-achievement group was found to be relatively longer than that of the high- 
achievement group. The low-achievement group was approximately 295% (Is/ + stop + liquid), and 154% (Is/ + stop) 
longer than was the high-achievement group. In addition, when the percentage of the duration of oral closure was 
calculated, the mean duration of the low-achievement group (38% for /s/ + stop + liquid; 33% for /s/ + stop) was also 
significantly longer than that of the high-achievement group (9% for /s/ + stop + liquid; 13% for /s/ + stop). 


Table 6. Mean duration of oral closure (in s) for the high and low-achievement groups 



Triliteral Onsets 

Biliteral Onsets 


Oral Closure 
in /s/ + stop+ liquid 

Oral Closure 
in /s/ + stop 

Oral Closure 
in /s/ + nasal 

Oral Closure 
in /s/ + liquid 

High 

Low 

M SD t 

°- 04 °- 02 0 000* 

0.17 0.08 

M SD t 

°.° 6 0.05 

0.15 0.04 

M SD t 

M SD t 


*/? < 0.001 


An example of the findings above is provided below in Figures 3 and 4, which show two pronunciations of the word 
script. The waveform and spectrogram show clearly that for a student in the high-achievement group, the /s/ is longer 
and the oral closure is shorter than are those of a student in the low-achievement group. 


0 543968 



-0 989 
5000 Hz 


Word 

(1/D 


script 



Visible part 0.503853 seconds 


0.543968 


Total duration 0.543968 seconds 


Figure 3. The word script without oral closure by a high-achievement student 


0.847120 



0 Visible part 0.847120 seconds 0.847120 

Total duration 0.847120 seconds 


Figure 4. The word script with oral closure by a low-achievement student 


4.2 Perception test 

To test the hypothesis that the difference in pronunciation caused by the proficiency level affects teachers’ judgments of 
accuracy, we examined whether there was any relationship between the score of each English /s/ + complex onset 
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cluster production by the two groups according to their L2 proficiency. With respect to the ratings of the samples, the 
dependent variable was the mean of the accuracy ratings that were calculated by averaging the two English listeners’ 
ratings for the twenty Korean subjects. When the intra-class correlation coefficient measured the degree of inter-rater 
reliability for each group of raters’ evaluations of the subjects’ speech, the raters’ coefficients were highly correlated, r 2 
= 0.90, p < 0.001, thus indicating high levels of agreement among the native English speaking raters. The Pearson Chi- 
Squared statistic was employed for statistical analysis of the association between students’ proficiency level and their 
perception test scores. According to the type of complex onset clusters, either the scores were, or were not, related 
significantly related to the proficiency levels. Triliteral onset with word-initial /s/ showed a much more significant 
association (x 2 = 17.33, p < 0.05), than did biliteral onset. Among biliteral onsets, the score for the /s/ + stop clusters (x 2 
= 14.29, p < 0.05), was the most significantly different, followed by the /s/ + liquid cluster (x 2 = 13.46, p < 0.05). 
However, /s/ + nasal clusters (x 2 = 5.91, p > 0.05), did not differ significantly. 

5. Conclusions 

The findings overall showed that, compared to biliteral onset, triliteral onset with the word-initial /s/ was associated 
significantly with the (a) acoustic properties of pronunciation and English proficiency, and (b) ratings and English 
proficiency. As the syllable structure rules in Korean do not allow two or more consonants to occur in the onset of a 
syllable, it was presumed that the Korean L2 learners have even more trouble when three consonants rather than two 
occur at the beginning of syllables. More interestingly, the results of this study are related to biliteral onset with word- 
initial /s/. The SSP theory has been adopted by many researchers, such as Carlisle (1991), and Eckman and Iverson 
(1993), as a theoretical framework to justify consonant cluster production. Because the sonority distance in the /s/ + 
nasal clusters is smaller than in the /s/ + liquid, the SSP would predict that /s/ + nasal clusters would be more difficult 
for Korean L2 learners. However, this study demonstrated that the SSP does not always hold true. The finding that Is/ + 
stop clusters, among other cluster types of biliteral onset with word-initial /s/, showed the most significant association 
between the (a) acoustic properties of pronunciation and English proficiency, and (b) ratings and English proficiency, 
could be explained by the SSP. Yet, in contrast to sonority distance, /s/ + nasal clusters did not differ significantly, 
while /s/ + liquid clusters did. Here, the well-known phenomenon that many Korean L2 learners have difficulty in 
differentiating between the /r/ and III sound, which is seen commonly as evidence of negative LI transfer because 
Korean lacks the phonemic and phonetic distinctions between [r] and [1] found in English, should be taken into account 
(Kang & Ahn, 2013; Kang, 2015). Thus, the reason that learners in the low-achievement group were less proficient in 
/s/ + liquid clusters than in /s/ + nasal clusters might be relevant to the problematic segment III. In addition, the learners 
who had difficulty pronouncing biliteral onset with word-initial /s/ and a stop sound were found to struggle with 
triliteral onset with word-initial /s/ (i.e., s + stop + liquid), but not the converse. 

The results of this study have many pedagogical implications as the beginning of words is crucial for accurate word 
recognition and latencies in word recognition of 250-300 milliseconds may cause a continuous breakdown in 
communication (Byrd & Mintz, 2010). Among three cluster types, i.e., s + stop, s + nasal, s + liquid, this study 
suggested that the pronunciation of /s/ in /sk/, /sp/ and /st/ in onset clusters requires more attention in L2 acquisition 
studies than do others, such as /sm/, /sn/, or /si/. In the classroom, L2 teachers should be encouraged to exaggerate the 
initial /s/ as one pedagogical technique to deal with /s/ in onset consonant clusters (Celce-Murcia et al., 2010; Koffi, 
2011). For example, the initial /s/ in words such as, skip, study , and speak can be elongated as /sssskip/, /sssstAdi/ and 
/sssspi:k/. Further, words like street , script , and spring can be practiced as /sssstri:t/, /sssskript/ and /sssspriq/. More 
research on English consonant onset clusters is needed to benefit L2 learners of English, and specific materials must be 
developed to facilitate pronouncing English in a natural and effective way. 
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